Picture for Xiangfeng Wang

Xiangfeng Wang

Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents

Add code
May 28, 2026
Viaarxiv icon

OptSkills: Learning Generalizable Optimization Skills from Problem Archetypes via Cluster-Based Distillation

Add code
May 28, 2026
Viaarxiv icon

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Add code
May 28, 2026
Viaarxiv icon

AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization

Add code
Mar 24, 2026
Viaarxiv icon

It Takes Two to Tango: A Holistic Simulator for Joint Order Scheduling and Multi-Agent Path Finding in Robotic Warehouses

Add code
Feb 15, 2026
Viaarxiv icon

PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering

Add code
Feb 12, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

See, Plan, Snap: Evaluating Multimodal GUI Agents in Scratch

Add code
Feb 11, 2026
Viaarxiv icon

R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging

Add code
Feb 06, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon